Transform ation and Com bination of Hidden M arkov M odels for Speaker Selection Training
نویسندگان
چکیده
This paper presents a 3-stage adaptation framework based on speaker selection training. First a subset of cohort speakers is selected for test speaker using Gaussian mixture model, which is more reliable given very limited adaptation data. Then cohort models are linearly transformed closer to each test speaker. Finally the adapted model for the test speaker is obtained by combining these transformed models. Combination weights as well as bias items are adaptively learned from adaptation data. Experiments showed that model transformation before combination would improve the robustness of the scheme. W ith only 30s of adaptation data, about 14.9% relative error rate reduction is achieved on a large vocabulary continuous speech recognition task.
منابع مشابه
HHMM Based Recognition of Human Activity Motion Trajectories in Image Sequences
A bstract In this paper, w e present a m ethod for recognition of hu m an activity as a series of actions from an im age sequ ence. T he diffi cu lty w ith the problem is that there is a chicken-egg dilem m a that each action needs to be extracted in advance for its recognition bu t the precise extraction is only possible after the action is correctly identifi ed. In order to solve this dilem m...
متن کاملA Speaker-Independent Continuous Speech Recognition System Using Biomimetic Pattern Recognition
-—— In speaker-independent speech recognition. the disadvantage of the most di行used technology (HMMs.or Hidden Markov models)is not only the need of m any m ore training sam ples,but also long train tim e requirem ent. This PaPer describes the use of Biom im etic pattern recognition(BPR)in recognizing some mandarin continuous speech in a speaker-independent m anner. A speech database was develo...
متن کاملDevelopment of robust speech recognition middleware on microprocessor
W e have developed speech recognition middleware on a RISC microprocessor which has robust processing functions against environmental noise and speaker differences. The speech recognition middleware enables developers and users to use a speech recognition process for many possible speech applications, such as car navigation systems and handheld PCs. In this paper, we report implementation issue...
متن کاملHybrid neural network/hidden Markov model continuous-speech recognition
n M In this paper we present a hybrid multilayer perceptron (MLP)/hidde arkov model (HMM) speaker-independent continuous-speech recognib tion system, in which the advantages of both approaches are combined y using MLPs to estimate the state-dependent observation probabilities p of an HMM. New MLP architectures and training procedures are resented which allow the modeling of multiple distributio...
متن کاملParameterisation of Three-Valued Abstractions
T hree-valued abstraction is an established technique in software m odelchecking. It proceeds by generating a state space m odelover the values true, false and unknown, where the latter value is used to represent the loss of inform ation due to abstraction. T em porallogic properties can then be evaluated on such m odels. In case of an unknown result, the abstraction is iteratively refined. In ...
متن کامل